The Web in Theoretical Linguistics Research: Two Case Studies Using the Linguist’s Search Engine
نویسندگان
چکیده
0. Introduction The whisper of “does that sound ok to you?” is a familiar sound to most linguists: we often hear it in the audience when a presenter supports a theoretical point using a judgment of grammaticality or ungrammaticality, and someone in the audience disagrees with the judgment. In recent years a growing number of researchers have investigated the use of introspective judgments underlying work in theoretical syntax and in linguistics more generally. As Bard, et al. (1996) put it, each linguistic judgment is a “small and imperfect experiment.” Schütze (1996) and Cowart (1997) provide detailed discussions of instability and unreliability in such informal methods, which can lead to biased or even misleading results. Alternatives to introspective judgments include psychological methods (Bard et al. 1996), quantitative modeling in order to approximate judgments (Lapata et al. 2001), and broader reexamination of the idealizations underlying linguistic theory (Abney 1996; Sorace and Keller 2005). Tools for searching naturally occuring text potentially provide an additional window on linguistic data (Christ 1994; Corley et al. 2001; Blaheta 2002; Kehoe and Renouf 2002; König and Lezius 2002; Fletcher 2002; Kilgarriff, Evans et al. 2003), but typically these tools involve computational sophistication, restriction to a particular corpus, or shallow word-level search, making them less attractive to “the Ordinary Working Linguist without considerable computer skills” (Manning 2002). We designed the Linguist’s Search Engine (LSE) to empower ordinary working linguists to search the Web for linguistic data (http://lse.umiacs.umd.edu; Resnik and Elkiss 2005). The LSE’s architecture permits efficient search using syntactic and lexical criteria, with special attention to the needs of linguists, and
منابع مشابه
Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore
Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...
متن کاملWeaving web data into a diachronic corpus patchwork
This paper offers a reassessment of the role of web data in diachronic linguistic analysis. We introduce the diachronic search facilities provided by the WebCorp Linguist’s Search Engine, including the use of a new ‘heat map’ graph for the analysis of changes in collocational patterns over time. We illustrate how web data can be used to supplement data from standard corpora in lexicological stu...
متن کاملA New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملThe Linguist's Search Engine: An Overview
The Linguist’s Search Engine (LSE) was designed to provide an intuitive, easy-touse interface that enables language researchers to seek linguistically interesting examples on the Web, based on syntactic and lexical criteria. We briefly describe its user interface and architecture, as well as recent developments that include LSE search capabilities for Chinese.
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005